Degree and component size distributions in generalized uniform recursive tree 
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We propose a generalized model for uniform recursive tree (URT) by introducing an imperfect 
growth process, which may generate disconnected components (clusters). The model undergoes an 
interesting phase transition from a singly connected network to a graph consisting of fully isolated 
nodes. We investigate the distributions of degree and component sizes by both theoretical predictions 
and numerical simulations. For the nontrivial cases, we show that the network has an exponential 
degree distribution while its component size distribution follows a power law, both of which are 
related to the imperfect growth process. We also predict the growth dynamics of the individual 
components. All analytical solutions are successfully contrasted with computer simulations. 

PACS numbers: 89.75.Hc, 89.75.Fb, 02.50.Ey, 05.10.-a 



Degree distribution and component (cluster) size dis- 
tribution are two important properties [1[ of complex net- 
works, which have become a focus of attention for the 
scientific community fH, H, 0, [E] ■ It has been established 
that degree distribution has an important consequence on 
other structural characteristics of networks [H, such as 
average path length, clustering coefficient, betweenness 
centrality, and among others. Particularly, degree dis- 
tribution has also profound effects on almost all aspects 
on dynamic processes taking place on networks, includ- 
ing robustness Q , percolation [ t, ^1 , synchronization 9] , 
games epidemic spreading [ll|, and so on. On the 
other hand, as a fundamental structural feature of net- 
works, component (especially the giant component) size 
is one of the most significant measures of the network ro- 
bustness [1, in particular, component size distribu- 
tion is directly related to many significant practical issues 
such as the distribution of the sizes of disease outbreaks 
for disease propagation on contact networks [H, IH, ■ 

A wide variety of network models has been proposed 
to describe real-life systems and study their structural 
properties 0j S S IMi among which uniform recursive 
tree (URT) is perhaps one of the earliest and most widely 
studied simplest models. The URT is a growing tree con- 
structed as follows: Start with a single node, at each in- 
crement of time, let a new node be added to the network 
linked to a randomly selected preexisting node, ft has 
found applications in several areas. For example, it has 
been suggested as models for the spread of epidemics [l^ , 
the family trees of preserved copies of ancient or medieval 
texts 16], chain letter and pyramid schemes [l7j . to name 
but a few. 

Most previous models belong to idealized models 0, 
Hi 0, IE] I although they may provide valuable insight into 
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reality. In fact, real- world systems may be viewed as im- 
perfect realizations of idealized theoretical models p^ . 
which have significant influences on their features and 
function. For instance, introducing defects (impurities) 
into magnetic materials may drastically change its prop- 
erty [l^ . Recently, many authors have focused on inves- 
tigating the influence of deleting nodes (or edges) on the 
function of networks such as their integrity and commu- 
nication ability [i,|l,|23|- 

In this paper, we present a generalized model of URT 
by introducing an imperfect growth process which may 
generate disconnected components. The proposed model 
is governed by a tunable parameter q. We study both 
analytically and numerically the interesting characteris- 
tic aspects of the models, focusing on degree distribu- 
tion, component size distribution, as well as the evolving 
dynamics of individual componets. We show that both 
distributions can be computed accurately, although cal- 
culation of component size distribution seems difficult for 
previously studied models. The obtained results indicate 
that the imperfect growth process affects fundamentally 
the structure of the network. 

The proposed generalized model with an imperfection 
growth process is generated in the following way. We 
start from an initial state [t = 0) of one isolated node. 
Then, at each increment of time, a new node is added, 
whose behavior of growth dynamics depends on a pa- 
rameter q. With probability q, the new node keeps iso- 
lated; and with complementary probability 1 — q, the 
newly introduced node connects a randomly chosen old 
node. This growing process is repeated until the network 
reaches the desired size. One can easily see that at time 
t, the network consists of t+1 nodes and expected {l — q)t 
edges. Thus, when t is large, the average node degree at 
time t is approximately equal to a constant value 2(1 — g). 

There are two limiting cases of the present model with 
URT as one of its particular case. Therefore, we call 
the presented model "generalized uniform recursive tree" . 
For (7 = 0, the model coincides with the uniform recursive 
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tree which is a singly connected network. When q = 1, 
the network is reduced to a fully disconnected network. 
Thus, varying q in the interval (0,1) allows the formation 
of disconnected clusters as part of the growing dynamics. 

Below we will show that some interesting characteris- 
tics (i.e., degree distribution, component size distribution 
and evolutionary dynamics of individual components) 
may be investigated analytically, which depend on the 
parameter q, i.e., the imperfect growth process. 

First we focus on the degree distribution. For q = 1, all 
nodes have the same number of connection 0, the network 
exhibits a completely homogeneous degree distribution. 
For the case of < g < 1, we can address the degree dis- 
tribution using the master-equation approach [21|. For 
simplicity, we label nodes by their time of birth, so that 
node s refers to the node introduced at time s, and use 
p{k, s, t) to denote the probability that at time t the node 
s has a degree k. Then we may analyze the dynamics 
through a set of master equations [21, 22, 23] governing 
the evolution of the degree distribution of an individual 
node, which are of the following form 



p{k, s,t+l) 



1 



t+l 



p{k - l,s,i) + 1 



1 



t + l 

+ qSkfi + (1 - q)hs 



p{k,s,t) 
(1) 



with the initial condition, p(fc, s — Q^t — Q) — Skfi- This 
accounts for two possibilities for a node: first, with prob- 
ability it may get an extra edge from the new node, 
and thus increase its own degree by 1; and second, with 
the complimentary probability 1 — Trpfi node may 
remain in the former state with the former degree. 

Based on Eq. ^ , one can obtain the total degree distri- 
bution Pt{k) specifying the probability that a randomly 
chosen node has degree k at time t: 



Pt{k) 



1 * 
^^p(fc,s,i). 



(2) 



s=0 



Summing up both sides of Eq. ([T]) over s, we get the 
following master equation for the degree distribution: 

(t + 2)Pt+i{k)-{t+\)Pt{k) (3) 
= (1 - q)Pt{k - 1) - (1 - q)Pt{k) + <z4,o + (1 - q)5ks- 

In the infinite t limit, each Pt{k) converges to some lime 
P{k). Then, the corresponding stationary equation takes 
the form 

(2 - q)P{k) - (1 - q)P{k - 1) = q5kfl + (1 - g)^,!- (4) 
It has the solution of an exponential form 

(5) 
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For q = 0, Eq. (O is exactly the same degree distribution 
of the uniform recursive tree [sj . 




FIG. 1: (Color online) Semilogarithmic graph of degree dis- 
tribution of the networks with order A'' — 200000. Each point 
is an average over 10 independent simulations. The solid lines 
are the analytic calculation values given by Eq. ([SJ. 



In Fig. [U we report the simulation results of the degree 
distribution for several values of q, from which we can see 
that the degree distribution decays exponentially with 
the degree, in agreement with the analytical results. 

Next, we show that the distribution of component sizes 
follows a power law with exponent determined by param- 
eter q. For the case of g = 0, there is only one compo- 
nent in the network, and the component size distribution 
is trivial. For g = 1, there are t+l components, every 
node belongs to a component of size 1 (the node itself). 
In the range < g < 1, the imperfect growth process al- 
lows the formation of multiple disconnected components. 
For this case, we can derive an analytical expression for 
the com pon ent size distribution using the rate-equation 
method [2^ . 

Initially {t — 0), there is only one component in the 
network. At each subsequent step, a new cluster is cre- 
ated with probability q. Thus, after t step evolution, 
there are expected N{c,t) — tq + 1 components in the 
network, and the average value of component sizes is 
asymptotically equal to i for large t. Let Nc{t) denote 
the average number of components with size c at time 
t. By the very construction of the generalized uniform 
recursive tree, when a new node enters the network, the 
rate equations [l^, [13] that account for the evo- 

lution of Nc(t) with time t are 



dN,{t) 
dt 



(l-g) 



{c-l)N,^i{t)^cN,{t) 
Ec' c'N^'it) 



+ q5c,i. (6) 



Here the first term on the right-hand side of Eq. ([6]) ac- 
counts for the process in which the new node is connected 
to one node in a cluster with size c — 1, leading to a gain 
in the number of components with size c. Since there are 
Nc-i{t) components of size c— 1 and the new node creates 
a new edge with probability 1 — q, such processes occur 
at a rate proportional to (1 — q){c — l)Nc-i{t), while the 
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FIG. 2: (Color online) Double-logarithmic plot of the com- 
ponent size distribution of the networks with 200000 nodes 
each. The points are the results of computer simulations, and 
each point is obtained by 20 independent network realizations. 
The solid lines correspond to the analytic solution provided 
by Eq. (|8}. 



factor J2c' Nci{t) converts this rate into a normalized 
probability. The second (loss) term describes the new 
edge connecting to one of the nodes in a cluster with size 
c turning it into a component with size c -(- 1. The last 
term on the right-hand side of Eq. ([6]) accounts for the 
continuous introduction of a new component with size 1 
(the isolated new node itself). 

Let P{c) be the component size distribution that is the 
probability of a randomly chosen cluster having size c. In 
the asymptotic limit Nc{t) = N{c,t)P{c) ~ tqP{c) and 
c' Nc'{t) — t + 1. Inserting these into Eq. we 
have the following recursive equation 




givmg 



Pic) 



n 



1 r(c)r(2 + ^ 



2 - g TO 

m—2 



1 

1-9 



for c > 1. Thus, in the infinite c limit 

P(c) ~C-(l+T^), 



(8) 



(9) 



which shows that component size distribution follows a 
power-law form with an exponent 7c = 1 + jz^ dependent 
on parameter q. 

We have performed extensive numerical simulations for 
the full range of q between and 1. In Fig. [21 we plot 
the component size distribution P(c) as a function of q, 
which agrees well with the analytic result. 



FIG. 3: (Color online) Double-logarithmic graph showing the 
time-evolution for the sizes of two components. Here q = 0.2, 
and the dashed line has slope 0.8, as predicted by Eq. pip . 



In addition to component size distribution, using the 
continuum approach 28] we can calculate the time de- 
pendence of component size c; of a given component i, 
i = 1, 2, ■ • ■ , N{c, t). The size will increase by one every 
time a new node is added to the system and linked to 
one of the nodes belonging to this component. Assum- 
ing that Ci is a continuous real variable, according to the 
generating algorithm of the model, the rate at which a 
changes is clearly proportional to Ci itself. Consequently 
Ci satisfies the dynamical equation [2^ [2^ 



dci{t) _ g)'^^(^) 



dt 



t 



(10) 



The solution of this equation, with the initial condition 
that component i was born at time ti with size Ci{ti) = 1, 
is 



1-q 



(11) 



Equation (jlip shows that the size of all components 
evolves the same way, following a power law, the only dif- 
ference being the intercept of this power law, see Fig. [S] 
To conclude, by introducing an imperfect network 
growth process we have proposed an extended model for 
uniform recursive tree. The presented model interpolates 
between a network with fully disconnected clusters and 
the uniform recursive tree with only one single compo- 
nent, which allow us to explore the crossover between the 
two limiting cases. We have provided both analytically 
and numerically the solutions for degree and component 
size distributions of our model. We found that in the case 
of < g < 1 the model exhibits an exponential degree 
distribution and a power-law component size distribu- 
tion. We also presented that the size of all components 
evolves as a power-law function of time t. 



4 



Our model has a remarkable character that some nodes 
may become disconnected from the rest of the network. 
This property has been less reported in earlier studied 
models and thus has not yet been paid enough atten- 
tion to. Actually, our model may be further extended to 
include initial attractiveness |2i| and preferential attach- 
ment [s^. That is, all nodes are born with some initial 
attractiveness A > 0. In the growth progress, the proba- 
bility that an old node will receive an link from the new 
old is proportional to the sum of the initial attractive- 
ness and its degree. In this more general case, the degree 
distribution is power law with exponent dependent on A, 
while the component size distribution is the same as that 



of the present model addressed here and is independent of 
parameter A. Finally, we believe our model could stimu- 
late and find some obvious implications for some related 
researches such as disease propagation in future. 
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